Investigating Some Imputation Methods of Multivariate Imputation Chained Equations
نویسندگان
چکیده
This paper investigates three MICE methods: Predictive Mean Matching (PMM), Quantile Regression-based Multiple Imputation (QR-basedMI) and Simple Random Sampling (SRSI) at imputation numbers 5, 15, 20 30 with 5% 20% missing values, to ascertain the one that produces imputed values best matches observed compare model fit based on AIC MSE. The results show that; QR-basedMI produced more didn’t match observed, SRSI better as number of imputations increases while PMM matched all missingness considered. for showed in terms MSE except M=15, result M= M=15 M=20 results. shows considered both where was seen produce It is concluded comparison, most suited when but QR-basedMI.
منابع مشابه
mice: Multivariate Imputation by Chained Equations in R
The R package mice imputes incomplete multivariate data by chained equations. The software mice 1.0 appeared in the year 2000 as an S-PLUS library, and in 2001 as an R package. mice 1.0 introduced predictor selection, passive imputation and automatic pooling. This article documents mice 2.9, which extends the functionality of mice 1.0 in several ways. In mice 2.9, the analysis of imputed data i...
متن کاملMultiple Imputation by Chained Equations in Praxis: Guidelines and Review
Multiple imputation by chained equations (MICE) is an effective tool to handle missing data an almost unavoidable problem in quantitative data analysis. However, despite the empirical and theoretical evidence supporting the use of MICE, researchers in the social sciences often resort to inferior approaches unnecessarily risking erroneous results. The complexity of the decision process when enco...
متن کاملEffect of Reference Population Size and Imputation Methods on the Accuracy of Imputation in Pure and Mixed Populations
Imputation as a method of creating low-density chips to high-density chips has been introduced to increase the accuracy of genomic selection in animals. In the current study, to investing imputation accuracy, three populations of mixed (scenario 1), pure (scenario 2) and mixed + pure (scenario 3) were simulated using QMSim. Two methods of imputation including Beagle and Flmpute were used fo...
متن کاملKernel imputation with multivariate auxiliaries
We consider a data set with missing observations but known auxiliaries for the sample and develop a real donor imputation. For each unit with missing observations we construct a distribution over a set of possible donors. We want the expectation (or distribution) to be chosen so that the expectation (or distribution) of the imputed values should equal the distribution of the units’ true values....
متن کاملInfluence of Outliers on Some Multiple Imputation Methods
In the field of data quality, imputation is the most used method for handling missing data. The performance of imputation techniques is influenced by various factors, especially when data represent only a sample of population, for example the survey design characteristics. In this paper, we compare the results of different multiple imputation methods in terms of final estimates when outliers oc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: European Journal of Mathematics and Statistics
سال: 2022
ISSN: ['2736-5484']
DOI: https://doi.org/10.24018/ejmath.2022.3.3.109